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Abstract: Anaerobic digestion of highly polymerized biomass by microbial communities 
present in diverse microbial ecosystems is an indispensable metabolic process for 
biogeochemical cycling in nature and for industrial activities required to maintain a sustainable 
society. Therefore, the evaluation of the complicated microbial metabolomics presents a 
significant challenge. We here describe a comprehensive strategy for characterizing the 
degradation of highly crystallized bacterial cellulose (BC) that is accompanied by metabolite 
production for identifying the responsible biocatalysts, including microorganisms and their 
metabolic functions. To this end, we employed two-dimensional solid- and one-dimensional 
solution-state nuclear magnetic resonance (NMR) profiling combined with a metagenomic 
approach using stable isotope labeling. The key components of biocatalytic reactions 
determined using a metagenomic approach were correlated with cellulose degradation and 
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metabolic products. The results indicate that BC degradation was mediated by cellulases 
that contain carbohydrate-binding modules and that belong to structural type A. The 
degradation reactions induced the metabolic dynamics of the microbial community and 
produced organic compounds, such as acetic acid and propionic acid, mainly metabolized 
by clostridial species. This combinatorial, functional and structural metagenomic approach 
is useful for the comprehensive characterization of biomass degradation, metabolic dynamics 
and their key components in diverse ecosystems. 

Keywords: nuclear magnetic resonance (NMR)-based metabolomic approach; 
heteronuclear correlation (HETCOR); metagenomic analysis; anaerobic ecosystem; 
carbohydrate -binding module (CBM) 



1. Introduction 

Microbial metabolism of highly polymerized biomass in anaerobic environments, known as 
anaerobic microbial digestion, is one of the most significant processes on Earth [1]. For example, 
anaerobic microbial digestion is responsible for biogeochemical cycling in environments, such as 
oceans and on land, and is used for the processing of industrial organic waste and wastewater [1,2]. 
This metabolic process is indispensable for maintaining natural environments and conducting 
industrial activities. It is mediated by complex microbial ecosystems that produce biogas from certain 
short-chain fatty acids [3]. Therefore, the evaluation of the metabolic dynamics of complex microbial 
ecosystems, known as microbial metabolomics, involved in anaerobic digestion is a significant 
challenge for understanding the mechanisms of biogeochemical cycling in the environment and for 
practical utilization and optimization of industrial processes required to maintain a sustainable society. 

To characterize and evaluate biomass degradation and the metabolic dynamics involved in 
anaerobic digestion that occurs in microbial ecosystems, we focused on cellulose degradation 
combined with the characterization of the structural heterogeneity of cellulose [4,5]. To address the 
challenge of this structural heterogeneity, solid-state nuclear magnetic resonance (NMR) spectroscopy 
provides a powerful tool for characterizing the structure and dynamics of cellulose [6-8] and facilitates 
the characterization of supermolecular structures, such as crystalline and amorphous forms [9-15]. 
Further, to evaluate microbial metabolomics, solution-state NMR is a powerful tool for evaluating 
metabolic dynamics in microbial ecosystems and biological systems present in diverse 
environments [4,16-19]. Moreover, we previously developed an analytical method for microbial 
metabolomics to monitor metabolic dynamics in microbial ecosystems by correlating the relationships 
between microbial communities and their metabolic activities [20]. 

In our previous study, the anaerobic microbial digestion of bacterial cellulose (BC) was 
successfully monitored using solid-, solution- and gas-state NMR spectroscopy with stable isotope 
labeling [5]. In this anaerobic microbial digestion process of BC, the reactions were initiated from the 
BC degradation by a microbial community that existed in the ecosystem, producing their enzymes. The 
BC is a partially crystalline polymer of 1— >4-linked [3-D-glucose units and exists as crystalline types la 
and ip, as well as in an amorphous form. The supermolecular structure of cellulose polymers 
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influences its physical properties and reactivity in synthetic and biological reactions; thus, these 
complex structures make it difficult to monitor their anaerobic digestion. Therefore, solid-state NMR 
was used as the most suitable approach to characterize the structural heterogeneity and to monitor the 
structural variations of BC in the previous study [5]. In addition, the reactions of BC degradation were 
accompanied with microbial metabolisms, i.e., the metabolic products were produced from the BC by 
the microbial community. The previous study used an NMR-based metabolomic approach to 
characterize solution- and gas-state metabolites in combination with the analysis of solid-state BC, 
because triple-phase NMR spectroscopy is the only method available that analyzes the reactions of 
solid, liquid and gas phases using a single instrument and characterizes the metabolic dynamics of BC 
degradation and the production and consumption of short-chain fatty acids and methane. Although this 
triple-phase NMR approach is a powerful tool for monitoring metabolic conversions of biomass into 
biogas through short-chain fatty acids by anaerobic digestion, the microbial community and its 

IT IT 

metabolic functions responsible for the degradation and metabolism of C-labeled BC ( C-BC) were 
not identified [5]. 

In the anaerobic digestion process, the key points of the reactions and elements of biocatalytic 
transitions consist of four prominent types, i.e., the polymeric substrate (BC), the produced 
metabolites, enzymatic biocatalyst (as players of extracellular degradation) and microbial biocatalyst 
(as players of intracellular reaction and the production of enzymes and metabolites). As with the 
above-mentioned and that reported in the previous paper, BC degradation is initially induced by the 
enzymatic and microbial biocatalyst followed by the structural and conformational changes in BC and 
metabolites production. For further understanding of the biocatalytic transitions in the anaerobic 
digestion process, we focused on the characterization of the key players of enzymatic and microbial 
biocatalyst with time-dependent variations, as well as their relationship between BC degradation and 
enzymatic biocatalyst variations and produced metabolic profiles and microbial biocatalyst variations. 

We here describe the evaluation and characterization of a microbial community and its metabolic 
functions related to BC degradation in an anaerobic microbial ecosystem. For the evaluation and 
characterization of the microbial community and its metabolism, we performed metagenomic analysis. 
The metagenomic analysis enables us to obtain a huge amount of the sequencing data, not only 
taxonomic information, but also the functional properties compared to conventional microbial 
community analyses, such as denaturing gradient gel electrophoresis [4,20], and, thus, provides new 
and deep insight into the relationship between phylogenetic and functional diversity in diverse 
environments [21,22]. This was accomplished by sequencing genomic material directly obtained from 
the microbial community. To evaluate microbial metabolic functions, we focused on cellulase, which 
is one of the key enzymes for the degradation of cellulose, produced by many organisms, including 
bacteria, Archaea and fungi [23]. Cellulase plays an essential role in BC degradation in anaerobic 
ecosystems and is a representative member of the glycoside hydrolase family, which hydrolyzes the 
glycosidic bonds of carbohydrates [24] and typically contains carbohydrate-binding modules 
(CBMs) [25]. CBMs are classified by their amino acid sequence similarity from which their molecular 
function can be deduced. Therefore, we considered that it is feasible to determine the microorganisms 
involved, as well as the conserved domains of certain proteins, CBMs, in particular, which are 
associated with the process, to evaluate the degradation of supermolecular structure in BC induced by 
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the complex microbial community, including unknown and unculturable microorganisms, using their 
various catalytic enzymes accompanied by their metabolic products in anaerobic ecosystems. 

We here used the E-class web tool [26] for the quantitative analysis of taxonomy and molecular 
functionality on the basis of metagenomic datasets available in the ECOMICS web toolkit [27], which 
provides a platform for trans-omics analysis of an environmental sample [28]. E-class enables the 
biological and functional classification of particular environmental and biological processes through 
the submission of sequences in a multi-FASTA [29] format and avoids the complicated procedure of 
uploading and registering the sequences with their detailed metadata. Further, NMR spectral data were 
processed using FT2DB [28,30], which digitizes one-dimensional (ID) and two-dimensional (2D)-NMR 
spectra for statistical analysis, as well as using the statistical analysis tool, HetMap, to integrate and 
display associations between heterogeneous datasets using ECOMICS, as described previously [10]. 
The variation in data for microbial communities and CBMs classified using E-class on the basis of 
metagenomic analysis correlated with the biomass degradation profiles measured using solid-state 
NMR spectroscopy and with metabolic dynamics measured using an NMR-based metabolomic 
approach according to published studies [4,20]. These correlation analyses revealed the relationships 
between microbial functions (CBMs) and BC degradation and between the microbial community and 
its metabolic dynamics (short-chain fatty acid production from the BC biomass). 

2. Results and Discussion 

This study focused on the characterization and elucidation of biocatalytic transitions in the 
anaerobic digestion process, i.e., the degradation of a supermolecular structure in BC induced by the 
complex microbial community, including unknown and unculturable microorganisms, using their 
various catalytic enzymes accompanied by their metabolic products. In commonly-used engineering 
processes, chemical reactions can be expressed as simpler chemical equations, while the anaerobic 
digestion process of BC (polymer substrate) is represented by vastly-complicated networks of 
structural transitions of BC and metabolic (product) dynamics in the microbial community. Because 
the anaerobic digestion process is intricately involved with a complicated microbial community, 
including unknown and unculturable microorganisms, accounting for a large portion of the 
community. To evaluate the biocatalytic transitions, we performed the analysis of a microbial 
community (catalytic producers) and CBMs (catalytic initiators) responsible for BC degradation 
accompanied by the evaluation of metabolite production using metagenomic analysis along with an 
NMR-based metabolomic investigation using the ECOMICS web toolkit (Figure 1). Solid-state 
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C- H-2D heteronuclear correlation (HETCOR) and solution-state ID H-NMR spectra were applied 
to evaluate BC degradation profiles and metabolomic profiles, respectively. The variation data 
acquired for the microbial community and CBMs were associated with BC degradation and metabolite 
production profiles by heterogeneous correlation analysis using HetMap, available from the 
ECOMICS website (Figure 1). 

2.1. Characterization of Microbial BC Degradation by Using Solid-State 2D HETCOR NMR 

Although BC degradation and metabolic production profiles were characterized previously [5], we 
further analyzed the C-BC degradation profiles by solid-state 2D HETCOR NMR spectroscopy, 
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because the binned data of 2D solid-state NMR spectra enabled us to perform high resolution analysis 
in the loading plots rather than ID solid-state NMR data, as reported in previous studies [10,11]. 
Because spin- and dipolar-coupled 13 C-'H cross-peaks were observed in the 2D plane in the time 
series of HETCOR spectra (Contact Time (CT) = 50 us), we employed our original 2D binning 
method, described previously [19,31]. Using 16 time points of numerical data matrices derived from 
the HETCOR spectra, we calculated principal component analysis (PCA) for monitoring structural 
changes in 1 C-BC. The 13 C-BC degradation profiles evaluated using PCA varied until 84 h after the 
start of the experiment and then converged on a position on the PCA score plot after 84 h (Figure 2A). 



Figure 1. Diagram of the analytical strategy for characterizing bacterial cellulose (BC) 
degradation in an anaerobic ecosystem. BC degradation and metabolic processes 
(left cartoon) were characterized using solid- and solution-state NMR spectroscopy with 

1 3 

C-labeled samples (colored in yellow) followed by the digitization of the spectra using 
FT2DB in the ECOMICS web toolkit. This stable isotope labeling technique promotes the 
analysis strongly in the mixture state, which could not eliminate the originally existing 
natural abundance of 12 C materials (colored in gray), such as fermentation sludge. The 
biocatalysts, including microorganisms and proteins (carbohydrate-binding modules 
(CBMs)), characterized by metagenomic analysis, were classified using the E-class web 
toolkit. The digitized NMR data and categorized metagenomic data were correlated using 
the HetMap web toolkit in ECOMICS (right scheme). 
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Further, comparative analysis of 13 C CP-MAS and HETCOR spectra between the initial (0 h), 

13 

intermediate (60 h) and last points (120 h) revealed that the NMR signals derived from C-BC were 
barely detected, but those from lipids and proteins were additionally increased at the last point 
(Supplementary Figures SI and S2). For validation of the lipid signal, 13 C Cross Polarization-Magic 
Angle Spinning (CP-MAS) NMR measurement was performed at the two conditions of CT = 8 ms and 
1 ms. The 13 C CP-MAS NMR spectra validated that a signal of 29.6 ppm in the 13 C CP-MAS spectra 
originated from -(CHa)n- in lipids, since it was emphasized, due to the high mobility at CT = 8 ms 
rather than 1 ms (Supplementary Figure SI). These results suggest that 13 C-BC was degraded by 
certain microbes by 84 h after the start of the experiment and is consistent with previously published 
C-BC degradation profiles determined using ID solid-state NMR spectra [5]. Comparison with 50- 
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and 1,000-us of CT for the same sample suggests that microbial enzymes caused changes in the 

1 3 

tertiary structure of C-BC. Specifically, HETCOR observed at shorter CT (50 us) did not exhibit 
significant changes from the initial to intermediate points (zero and 60 h), whereas longer CT (1,000 us) 
showed significant changes in the cross-peak intensities of C6-H6/H4, C4-H4/H6 and C1-H6. Note 
that the corresponding cross-peaks exhibit a characteristic loading plot (Supplementary Figure S3B). 
This suggests that the tertiary structure of 13 C-BC, including the packing arrangement of cellulose 
chains, may change because of the activity of microbial enzymes. 



Figure 2. (A) Time-course variations in BC degradation (polymer substrate); (B) produced 
metabolites; (C) microbial community (catalytic producers); and (D) CBMs (catalytic 
initiator) during the anaerobic digestion process evaluated by principal component analysis 
(PCA). The microbial community profiles categorized according to "class" are shown in (C). 
Arrows indicate the TP1 (24 h) and TP2 (84 h). 
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(D) Catalytic initiators (CBMs) 
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2.2. Time-Course Variations in Metabolites, Microbial Community Composition and CBMs 



To characterize the transitions of metabolic products derived from 13 C-BC degradation, we analyzed the 
solution-state ID ^-NMR spectra (microbial metabolic profiles) using PCA. The PCA score plots 
show that the time-course variations in microbial metabolic profiles were observed between 0-84 h 
(time point 2, TP2) after the start of the experiment and ceased after TP2 (Figure 2B), suggesting that 
microbial metabolic processes were in progress before TP2. The comparative analysis of H-NMR 
spectra between 24 h (time point 1, TP1) and TP2 revealed that the NMR signals derived from 

13 13 

C-labeled metabolites, such as acetic acid, which were possibly derived from C-BC metabolism by 
the microbial community, were abundantly produced at TP2 (Supplementary Figure S4). 13 C-acetic acid 
levels increased with time and contributed to the separation by PCA from the results of loading plot 
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analysis (Supplementary Figure S3A). This result was consistent with the results of a previous study on 

13 1 

metabolic profiles based on C-NMR spectra [5]. Moreover, the results of solution-state H-NMR and 
solid-state HETCOR spectral profiling suggest that a metabolic process generated by the anaerobic 
microbial ecosystem was caused by C-BC degradation and was accompanied by the production of 

1 % 

metabolites, such as C-acetic acid and -propionic acid until TP2. 

To characterize the microbial community and their metabolic processes that culminated in C-BC 
degradation, metagenomic analysis was performed in combination with E-class classification and 
multivariate statistical analysis. The microbial community (class) and CBM profiles evaluated by PCA 
varied and shifted from PCI -negative to -positive directions on PCA score plots until approximately 
TP2 and then varied toward the PC2 direction without any shifts toward the PCI direction after 
TP2 (Figure 2C,D). Further, the PCA score plots of the microbial community classified according 
to phylum (Supplementary Figure S5A), order (Supplementary Figure S5B), family (Supplementary 
Figure S5C) and genus (Supplementary Figure S5D) showed the same trends in transitions of the 
microbial community profiles classified according to class. This result indicates that the microbial 
community degraded 13 C-BC and produced metabolites until approximately TP2. Moreover, we 
consider TP2 as a turning point at which microbial activities and functions changed. These trends and 
transitions are consistent with the results of solution-state 'H-NMR and solid-state HETCOR spectral 
profiling and, therefore, suggest that 13 C-BC digested by this microbial community and CBMs induced 
transitions in microbial metabolism to produce acetic acid and propionic acid until approximately TP2. 

2.3. Key Components of Biocatalytic Transitions in BC Digestion Process 

To characterize the key components of biocatalytic transitions {i.e., the responsible microbes and 
CBMs) in the BC degradation process, we analyzed a dataset that included one million sequence read 
pairs obtained at TP1 or TP2 using 16S rRNA gene databases and selecting "class" as the taxonomic 
level. The taxonomic categories with the highest numbers were Bacilli (486 hits), Thermotogae (440), 
and Clostridia (288) at TP1 (Figure 3A) and Clostridia (716), Bacilli (692) and Thermotogae (351) at 
TP2 (Figure 3D). These results indicate that the numbers of Clostridia increased during the 
experiment. Further, we analyzed the sequences using the CBM databases and selected "CBM family," 
resulting in CBM48 (688), CBM34 (136) and CBM50 (135) in TP1 (Figure 3C) and CBM3 (1247), 
CBM48 (585) and CBM6 (540) in TP2 (Figure 3F), indicating an increase in the number of proteins 
with conserved CBM3 and CBM6 domains. To search for microorganisms classified into CBM3 and 
CBM6, we analyzed the datasets selected from the analyses of CBM by selecting "Organism" in the 
"Level" column of the E-class table. These results are listed in Figure 3 and show that bacteria 
annotated as C. thermocellum increased from 56 to 1,813 hits (by a factor of 32; Figure 3B,E). 
To confirm the reliability of this analysis, we performed a preliminary calculation of the average 
lengths of individual functional CBM families and found that the lengths of all CBMs and CBM3s 
were 104 and 81 amino acid residues, respectively, indicating that the numbers of proteins with CBM3 
increased significantly from TP1 to TP2. Moreover, sequences annotated as C. thermocellum and 
CBM3 increased from TP1 to TP2, indicating that it is highly possible that bacteria related to 
C. thermocellum (classified as Clostridium, the Clostridiaceae family, and the Firmicutes phylum) 
played an important role in the biodegradation of cellulose from TP1 to TP2. However, the numbers of 
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species (rRNA gene sequences) and proteins in public databases are significantly fewer than those of 
cultured and uncultured microorganisms on the planet. Therefore, if taxonomic and functional 
information is improved and enriched in the public database, species more closely related to 
C. thermocellum may be detected. Therefore, C. thermocellum is tentatively designated as an important 
bacterium responsible for cellulose biodegradation in the present study. Nevertheless, bacteria closely 
related to C. thermocellum (or clostridial species) likely contributed significantly to 13 C-BC degradation. 

Figure 3. Classification of the microbial community and CBM profiles based on 
metagenomic analysis data using E-class. The pie charts display taxonomic classification 
determined from the sequences of 16S rRNA genes (A,D) or according to the CBMs 
database (B,E) and the results of functional classification according to the CBMs database 
(C,F) at TP1 (A-C) and TP2 (D-F). The top three categories are displayed below the 
pie charts. 
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2.4. Relationships between Metabolic Dynamics and Biocatalytic Transitions of Microbial Community 



To evaluate the relationships between microbial community profiles and metabolomic variations in 
more detail, we performed heterogeneous correlation analysis between taxonomic variations in 
metagenomic data and solution-state 'H-NMR spectra (Figure 4). Bacteria classified as class Clostridia 
and Mollicutes correlated positively with 13 C-labeled acetic acid and propionic acid, which were 
possibly derived from 13 C-BC. In contrast, bacteria classified as class Fusobacteria correlated negatively 
with propionic acid synthesis, and those classified as class Actinobacteria and Acidobacteria correlated 
negatively with ethanol synthesis. These results indicate that 13 C-acetic acid and -propionic acid were 
metabolized and synthesized by Clostridia and Mollicutes. Using correlation analysis, we previously 
demonstrated that the levels of C-acetic acid and -propionic acid increased as cellulose was degraded, 
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suggesting that the generation of 13 C-labeled organic acids was caused by the anaerobic digestion of 
1 C-cellulose [5]. Considering our current findings, we suggest that 13 C-BC was digested by clostridial 
species and Mollicutes accompanied by the production of acetic acid and propionic acid in the 
anaerobic microbial ecosystem. 

Figure 4. Heterogeneous correlation analysis among taxonomic variations categorized 
according to "class" from metagenomic data and the solution-state 'H-NMR spectra. Note 
that the metabolite annotations in purple indicate the original 12 C-derived products, whereas 
the annotations in red indicate ^C-'H J-coupled signals generated by the metabolism of 

1 3 

C-cellulose. Further, the names of bacterial classes highlighted in orange indicate 
negative or positive correlations with metabolites. A cutoff score of 0.85 was imposed on 
the correlation coefficient of the heat map data. Blue and red show negative and positive 
correlations, respectively. 



Product metabolites 
(13q/12q chemical groups from 1 H-NMR spectra) 

Propionic acid 



3 C-Acetic acid 



Phenylacetic acid 



Acetic acid 



Ethanol 



'H(ppm) 



Butyric acid 



Propionic acid 



Ethanol 



hi 



13 C-Propionicacid 



Butyric acid 



co 



O 



o 
O 



co 
o 



(o £ 
'« 2 
>- >fc 

£ "1 
O % 

"co 
'i_ 
(U 

-t— ' 

o 
co 
DO 



Bacilli 
Thermotogae 
Clostridia 
Gammaproteobacteria 
Mollicutes 
Acti no bacteria 
Synergists 
Deltaproteobacteria 
Acidobacteria 
Betaproteobacteria 
Bacteroidia 
Alphaproteobacteria 
Spirochaetes 
Cytophagia 
Nitrospira 
Chlorobia 
Flavobactena 
Defembacteres 
Sphingobacteria 
Deinococci 
Methanobacteria 
Methanomicrobia 
Erysipelotrichi 
Fibrobacteres 
Epsilonproteobacteria 
Planctomycetacia 
Thermolithobacteria 
Thermomicrobia 
Fusobacteria 
Thermodesuifobacteria 
Chlamydiae 
Ktedonobactena 
Aquificae 
Dehatococcoidetes 
Thermococci 
Dictyoglomia 
Gemmatimonadetes 
Anaerolineae 
Chloroflexi 
Caldilineae 
Thermoprotei 
Methanococci 
Halobactena 
Others 



13 C-Acetic acid 



Ethanol 



3 C-Propionicacid 



Ethanol 



'C-Propionic acid 
Propionic acid 



1.00 
0.85 



0.00 



-0.85 
— -1.00 

Correlation 
factor 



2.5. Relationships between BC Digestion and Biocatalytic Transitions of Proteins 

To evaluate the relationships between CBM variations and 13 C-BC degradation profiles in more 
detail, heterogeneous correlation analysis between functional variations (CBMs) in metagenomic data 
and the solid-state HETCOR spectra were performed (Figure 5). Proteins categorized as CBM9 and 
CBM 17 correlated positively with the crystalline structure of 1 C-BC, whereas proteins categorized as 
CBM2, CBM3, CBM20 and CBM22 correlated negatively. These results indicate that 13 C-BC 
degradation was likely related to the proteins categorized as CBM2, CBM3, CBM20 and CBM22, and 
the bacterial population who carry the proteins categorized as CBM9 and CBM 17 were decreased 
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during BC degradation. Some CBM2s and CBM3 are categorized as structural type A, but other 
CBM2s are categorized as structural type B. 

Figure 5. Heterogeneous correlation analysis among functional variations (CBMs) in 

13 

metagenomic data and structural variation ( C-BC) in solid-state heteronuclear correlation 
(HETCOR) spectra. The 2D HETCOR data were divided into sequential 5-ppm intervals 
from 58 to 108 ppm for 13 C chemical shifts and sequential 0. 5-ppm intervals from 2.5 to 
5 ppm for 'EI chemical shifts. The integrated 'H chemical shifts are arranged from right to 
left for each 13 C chemical shift for presentation as ID data. The three-dimensional structures of 
the representative structural type A [PDB:1EXG (CBM3)], B [PDB:1H6X (CBM22)], and 
C [PDB:1I82 (CBM9)] of CBMs are displayed on the right side of the correlation 
map [32-34]. (PDB is Protein Data Bank [35]. Description of structural types after "PDB:" 
shows structure ID.) Note that carbohydrate-binding surfaces are highlighted with a black 
arrow on each CBM structure. A cutoff score of 0.7 was imposed on the correlation coefficient 
of the heat map data. Blue and red show negative and positive correlations, respectively. 
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In addition, CBM17, CBM20 and CBM22 are categorized as structural type B, whereas the CBM9 
is categorized as structural type C (refer to the representative CBM architecture of structural 
types A-C, Figure 5) [25]. The type A CBMs slide and diffuse across the surface of cellulose, while 
the substrate-specific types B and C lock onto their ligands and, thus, direct the enzyme to its target 
glycosidic bonds [36-38]. The type A CBMs (CBM2 and CBM3) bind to the flat, hydrophobic 
surfaces of cellulose crystals [25,39,40]. Therefore, the proteins categorized as CBM2 and CBM3 
likely degraded the crystalline structure of BC, suggesting that they are the major contributors to the 
microbial digestion of 1 C-BC. In contrast, proteins categorized as CBM20 and CBM22 that correlated 
negatively with 13 C-BC degradation degraded amorphous cellulose and were not likely to degrade 
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crystalline BC. Apparently, the bacteria carrying type A CBMs or the bacteria that increased in number 
in the 13 C-BC digestion experiment may synthesize proteins categorized as CBM20 and CBM22. This 
would explain why CBM20 and CBM22 correlated negatively with the crystalline structure of BC, 
because CBM classification and quantification are based on sequence-based metagenomic analysis 
{i.e., an increase or decrease in the detected number of sequences depended on increases or decreases 
in the bacterial population). 

The results of HETCOR and the metabolomic analysis data show that the relative peak intensities of 
cellulose (approximately 55-110 ppm) [41^15] decreased with time, while the levels of metabolites, 
such as 13 C-acetic acid and -propionic acid increased (Supplementary Figures SI, S2 and S4). 
By integrating the taxonomic and functional classification using E-class with solid-state HETCOR 
measurements and NMR-based metabolomic profiles, we revealed that 13 C-BC was degraded mainly 
by Clostridia that produced cellulases classified as CBM2 and CBM3, accompanied by the conversion 
of 13 C-BC to 13 C-acetic acid and -propionic acid (Figure 6). On the other hand, the bacterial population 
who carry the proteins categorized as CBM9 and CBM 17 were decreased during BC degradation. 
Taken together, this study was able to characterize the structural and conformational changes in the BC 
degradation process, to identify the key microbial players and enzymes of the degradation process, to 
track and monitor the metabolic dynamics in the microbial community using stable-isotope labeling, to 
link the microbial community and activity with structural and conformational changes of BC and to 
produce metabolites from the BC. 

Figure 6. Illustration of the summarized biocatalytic transition for BC degradation in the 
anaerobic digestion ecosystem. The difference in the biocatalytic transition at TP1 and TP2 
is summarized in this figure. Chains of yellow hexagons mean 13 C-BC as the polymer 
substrate. 13 C-labeled products metabolized from 13 C-BC show up as yellow nodes in the 
structural formula. 
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Therefore, these findings indicate that monitoring biomass degradation and metabolic profiles in 
microbial ecosystems using solid- and solution-state NMR spectroscopy with isotope labeling 
technology provides a powerful tool for evaluating molecular and metabolic dynamics. By combining 
these techniques with metagenomic approaches based on next-generation sequencing, the approach 
described here may be used for functional and structural metagenomic analysis. It should also be 
capable of deciphering complicated metabolic networks and dynamics induced by biomass degradation 
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and revealing key microbes and how they function in this process, particularly when metagenomic 
datasets become more robust as metagenomic technologies improve. This will lead to a better 
understanding of the organisms involved and their mechanisms of action that contribute to 
environmental and biological processes. 

3. Experimental 

3.1. General 

The materials and samples, including C-BC produced by Gluconacetobacter xylinus used in this 
study, were prepared as described previously [5]. Briefly, powdered C-BC was incubated in stirred 
tank reactors at a constant temperature of 55 °C for 120 h under anaerobic condition with thermophilic 
anaerobic digestion sludge. Solid cellulosic samples and solutions of metabolite mixtures, as well as 
bacterial DNAs were sampled at 16 time points from 0 to 120 h during anaerobic fermentation. 
Metagenomic analyses to evaluate the microbial community and CBMs were performed using an 
Illumina GAIIx (Illumina Inc., San Diego, CA, USA). The variations in metagenomic profiles were 
analyzed using multivariate statistical analysis and correlated with BC degradation profiles determined 
from solid-state HETCOR spectra recorded on a Bruker DRX-800 spectrometer and microbial 
metabolomic profiles determined from solution-state ! H-NMR spectra recorded on a Bruker DRX-700 
spectrometer using the ECOMICS web toolkit. 

3.2. NMR Spectroscopy 

Solid-state 2D HETCOR spectra were recorded using an DRX-800 spectrometer 
(Bruker-BioSpin, Billerica, MA, USA) with a Bruker 4-mm Magic Angle Spinning (MAS) triple 
resonance probe, as described previously [5]. The MAS frequency, the contact time and the recycle 
delay for HETCOR was fixed at 12 kHz, set to 50 and 1,000 us and set to 1.2 s, respectively. Note that 
16 time points acquired with CT = 50 us were used for the PCA calculation as follows. For the 2D 
HETCOR spectra, the data were processed using the NMRPipe software [46] and reduced by 
subdividing the spectra into sequential 5-ppm regions between 13 C chemical shifts of 58-108 ppm and 
sequential 0. 5-ppm regions between l H chemical shifts of 2.5-5 ppm using FT2DB software [30] in 
the ECOMICS web toolkit. Solid-state ID CP-MAS spectra were recorded on a Bruker DRX-800 
spectrometer with CT = 8 ms and 1 ms, as described previously [5]. Solution-state ID ! H-NMR 
spectra were recorded on a Bruker AV700 spectrometer, as described previously [5]. 

3.3. DNA Extraction and Metagenomic Dataset Preparation 

Genomic DNAs were extracted from the samples according to a previous report [20]. In brief, the 
samples were centrifuged and separated into precipitates and supernatants. The precipitates were dried 
under vacuum and milled in a solution of 10% sodium dodecyl sulfate to crush the microbial cells. The 
genomic DNAs were then extracted from the crushed samples. We outsourced the preparation of the 
metagenomic dataset to RIKEN GENESIS Co. Ltd. (Yokohama, Japan). Sequence libraries comprising 
microbial sequences (averaging 180 bp) were generated from the DNA samples, and metagenomic 
datasets (valid paired-end sequences with 75 and 72 bases) were acquired from the libraries 
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using the Illumina GAIIx. We obtained sequence datasets of the individual samples, including 
approximately 1 million paired-end sequences, which are available on the E-class website [26]. 

3.4. Classification of the Microbial Community and CBMs Using the E-Class Web Tool 

For taxonomic and functional categorization using 16S and 18S rRNA genes and the CBM domain, 
we used the E-class web tool available on the ECOMICS website [27,28]. For the analysis of the 
microbial community and CBMs involved in BC degradation, paired-end sequences were employed 
without assembly into contigs using the absolute number of sequences for the taxonomic and functional 
classification based on 16S rRNA genes and CBM in the E-class databases. The classification and 
categorization of the metagenomic dataset was performed by drawing pie charts simultaneously with 
the E-class web tool. 

3.5. Statistical Analysis 

For the multivariate statistical analysis, we used the recycle delay of 2 s in the NMR measurements 
to focus on numerical analysis of the variations of "relative" abundance, especially mobile 
components, based on the intensities of NMR spectra, as described in a previous report [47], not the 
quantification of the "absolute" abundance. Solution-state NMR data were reduced by subdividing the 
spectra and were processed and normalized for statistical analysis, as described previously [5]. PCA of 
the binned data was performed using "R" software according to a previous study [48]. A 2D 
heterogeneous correlation map was calculated as an asymmetric matrix between the variation data for 
CBM categorized using E-class and biodegradation profiles of BC based on HETCOR data and 
between microbial community profiles based on metagenomic data and microbial metabolomic 
profiles obtained from solution-state 'H-NMR data analyzed using Spearman's rank correlation 
coefficient on HetMap [49] in the ECOMICS web toolkit [4,10,28]. 

4. Conclusions 

We conducted a comprehensive analysis of the degradation of 13 C-BC by a microbial community. 
This analysis characterized the metabolites produced, the CBMs of the cellulases involved and the 
bacteria that were the most likely sources of these enzymes, determined using solid- and solution-state 
NMR profiling in combination with a metagenomic approach. These analyses indicated that C-BC 
was degraded by CBM2 and CBM3 cellulases. Clostridial species, which were highly abundant, were 
the most likely sources of these cellulases and produced metabolic compounds, such as C-acetic acid 
and -propionic acid from C-BC. The NMR-based metabolomic approach in combination with 
biomass profiling using solid-state NMR and metagenomic analysis described here is useful for the 
comprehensive characterization of biomass degradation, metabolic dynamics and the responsible 
microbes present in various environmental ecosystems. This methodology promises to be useful for 
deciphering complicated metabolic processes, networks and interactions within microbial communities. 

Acknowledgments 

The authors thank Eisuke Chikayama and Yuuri Tsuboi (RIKEN) for technical assistance and 
advice on NMR measurements and metagenomic analysis. This research was supported in part by 



Metabolites 2014, 4 



49 



Grants-in-Aid for Scientific Research (C) (to J.K.) and the Advanced Low Carbon Technology Research 
and Developmental Program (ALCA to J.K.) from the Ministry of Education, Culture and Sports. 

Conflicts of Interest 

The authors declare no conflict of interest. 
References 

1. Conrad, R. The global methane cycle: Recent advances in understanding the microbial processes 
involved. Environ. Microbiol. Rep. 2009, 1, 285-292. 

2. Chynoweth, D.P.; Owens, J.M.; Legrand, R. Renewable methane from anaerobic digestion of 
biomass. Renew. Energ. 2001, 22, 1-8. 

3. Schink, B. Energetics of syntrophic cooperation in methanogenic degradation. Microbiol. Mol. 
Biol. Rev. 1997, 61, 262-280. 

4. Date, Y.; Iikura, T.; Yamazawa, A.; Moriya, S.; Kikuchi, J. Metabolic sequences of anaerobic 
fermentation on glucose-based feeding substrates based on correlation analyses of microbial and 
metabolite profiling. J. Proteome Res. 2012, 11, 5602-5610. 

5. Yamazawa, A.; Iikura, T.; Shino, A.; Date, Y.; Kikuchi, J. Solid-, solution-, and gas-state NMR 
monitoring of 13C-cellulose degradation in an anaerobic microbial ecosystem. Molecules 2013, 
18, 9021-9033. 

6. Kikuchi, J.; Asakura, T. Use of C conformation-dependent chemical shifts to elucidate the local 
structure of a large protein with homologous domains in solution and solid state. J. Biochem. 
Biophys. Methods 1999, 38, 203-208. 

7. Mao, J.D.; Holtman, K.M.; Franqui-Villanueva, D. Chemical structures of corn stover and its 
residue after dilute acid prehydrolysis and enzymatic hydrolysis: Insight into factors limiting 
enzymatic hydrolysis. J. Agric. Food Chem. 2010, 58, 11680-11687. 

8. Mao, J.D.; Schmidt-Rohr, K. Accurate quantification of aromaticity and nonprotonated 
aromatic carbon fraction in natural organic matter by 13 C solid-state nuclear magnetic resonance. 
Environ. Sci. Technol. 2004, 38, 2680-2684. 

9. Mori, T.; Chikayama, E.; Tsuboi, Y.; Ishida, N.; Shisa, N.; Noritake, Y.; Moriya, S.; Kikuchi, J. 
Exploring the conformational space of amorphous cellulose using NMR chemical shifts. 
Carbohydr. Polym. 2012, 90, 1197-1203. 

10. Ogura, T.; Date, Y.; Kikuchi, J. Differences in cellulosic supramolecular structure of compositionally 
similar rice straw affect biomass metabolism by paddy soil microbiota. PLoS One 2013, 
8, e66919. 

11. Okushita, K; Chikayama, E.; Kikuchi, J. Solubilization mechanism and characterization of the 
structural change of bacterial cellulose in regenerated states through ionic liquid treatment. 
Biomacromolecules 2012, 13, 1323-1330. 

12. Okushita, K; Komatsu, T.; Chikayama, E.; Kikuchi, J. Statistical approach for solid-state NMR 
spectra of cellulose derived from a series of variable parameters. Polym. J. 2012, 44, 895-900. 



Metabolites 2014, 4 



50 



13. Komatsu, T.; Kikuchi, J. Selective signal detection in solid-state NMR using rotor-synchronized 
dipolar dephasing for the analysis of hemicellulose in lignocellulosic biomass. J. Phys. 
Chem. Lett. 2013, 4, 2279-2283. 

14. Earl, W.L.; Vanderhart, D.L. Observations by high-resolution C-13 nuclear magnetic-resonance 
of cellulose-I related to morphology and crystal-structure. Macromolecules 1981, 14, 570-574. 

15. Vanderhart, D.L.; Atalla, R.H. Studies of microstructure in native celluloses using solid-state 
carbon-13 NMR. Macromolecules 1984, 17, 1465-1472. 

16. Everroad, R.C.; Yoshida, S.; Tsuboi, Y.; Date, Y.; Kikuchi, J.; Moriya, S. Concentration of metabolites 
from low-density planktonic communities for environmental metabolomics using nuclear magnetic 
resonance spectroscopy. J. Vis. Exp. 2012, 62, e3163. 

17. Fukuda, S.; Nakanishi, Y.; Chikayama, E.; Ohno, H.; Hino, T.; Kikuchi, J. Evaluation and 
characterization of bacterial metabolic dynamics with a novel profiling technique, real-time 
metabolotyping. PLoS One 2009, 4, e4893. 

18. Fukuda, S.; Toh, H.; Hase, K.; Oshima, K.; Nakanishi, Y.; Yoshimura, K.; Tobe, T.; Clarke, J.M.; 
Topping, D.L.; Suzuki, T.; et al. Bifidobacteria can protect host from enteropathgenic infection 
through production acetate. Nature 2011, 469, 543-547. 

19. Nakanishi, Y.; Fukuda, S.; Chikayama, E.; Kimura, Y.; Ohno, H.; Kikuchi, J. Dynamic omics 
approach identifies nutrition-mediated microbial interactions. J. Proteome Res. 2010, 10, 824-836. 

20. Date, Y.; Nakanishi, Y.; Fukuda, S.; Kato, T.; Tsuneda, S.; Ohno, FL; Kikuchi, J. New monitoring 
approach for metabolic dynamics in microbial ecosystems using stable-isotope-labeling technologies. 
J. Biosci. Bioeng. 2010, 110, 87-93. 

21. McHardy, A.C.; Rigoutsos, I. What's in the mix: Phylogenetic classification of metagenome 
sequence samples. Curr. Opin. Microbiol. 2007, 10, 499-503. 

22. Gill, S.R; Pop, M.; Deboy, R.T.; Eckburg, P.B.; Turnbaugh, P.J.; Samuel, B.S.; Gordon, J.I.; 
Relman, D.A.; Fraser-Liggett, CM.; Nelson, K.E. Metagenomic analysis of the human distal gut 
microbiome. Science 2006, 312, 1355-1359. 

23. Demain, A.L.; Newcomb, M.; Wu, J.H. Cellulase, Clostridia, and ethanol. Microbiol. Mol. 
Biol. Rev. 2005, 69, 124-154. 

24. Henrissat, B. A classification of glycosyl hydrolases based on amino acid sequence similarities. 
Biochem. J. 1991, 280, 309-316. 

25. Boraston, A.B.; Bolam, D.N.; Gilbert, H.J.; Davies, G.J. Carbohydrate -binding modules: 
Fine-tuning polysaccharide recognition. Biochem. J. 2004, 382, 769-781. 

26. E-class, ECOMICS: Web tools for environmental and metabolic systems. Available online: 
https://database.riken.jp/ecomics/eclass/ (accessed on 20 December 2013). 

27. ECOMICS, ECOMICS: Web tools for environmental and metabolic systems. Available online: 
https://database.riken.jp/ecomics/ (accessed on 20 December 2013). 

28. Ogata, Y.; Chikayama, E.; Morioka, Y.; Everroad, R.C.; Shino, A.; Matsushima, A.; Haruna, FL; 
Moriya, S.; Toyoda, T.; Kikuchi, J. ECOMICS: A web-based toolkit for investigating the biomolecular 
web in ecosystems using a trans-omics approach. PLoS One 2012, 7, e30263. 

29. Pearson, W.R.; Lipman, D.J. Improved tools for biological sequence comparison. Proc. Natl. 
Acad. Sci. USA 1988, 85, 2444-2448. 



Metabolites 2014, 4 



51 



30. FT2DB, ECOMICS: Web tools for environmental and metabolic systems. Available online: 
https://database.riken.jp/ecomics/chika/index2.html (accessed on 20 December 2013). 

31. Sekiyama, Y.; Chikayama, E.; Kikuchi, J. Profiling polar and semipolar plant metabolites 
throughout extraction processes using a combined solution-state and high-resolution magic angle 
spinning NMR approach. Anal. Chem. 2010, 82, 1643-1652. 

32. Solution structure of a cellulose-binding domain from Cellulomonas fimi by nuclear magnetic 
resonance spectroscopy. Available online: http://www.rcsb. org/pdb/explore.do?structureId=lEXG 
(accessed on 20 December 2013). 

33. The role of conserved amino acids in the cleft of the C-terminal family 22 carbohydrate binding 
module of Clostridium Theermocellum XYN10B in ligand binding. Available online: 
http://www.rcsb. org/pdb/explore.do?structureId=lH6X (accessed on 20 December 2013). 

34. Crystal structures of the family 9 carbohydrate-binding module from Thermotoga maritima 
xylanase 10A in native and ligand-bound forms. Available online: http://www.rcsb.org/pdb/ 
explore. do?structureId=H82 (accessed on 20 December 2013). 

35. Protein Data Bank. Available online: http://www.rcsb.org/pdb/home/home.do (accessed on 
20 December 2013). 

36. Gilbert, H.J. The biochemistry and structural biology of plant cell wall deconstruction. 
Plant Physiol. 2010, 153, 444-455. 

37. Jervis, E.J.; Haynes, C.A.; Kilburn, D.G. Surface diffusion of cellulases and their isolated binding 
domains on cellulose. J. Biol. Chem. 1997, 272, 24016-24023. 

38. Kellett, L.E.; Poole, D.M.; Ferreira, L.M.; Durrant, A.J.; Hazlewood, G.P.; Gilbert, H.J. Xylanase B 
and an arabinofuranosidase from Pseudomonas fluorescens subsp. cellulosa contain identical 
cellulose-binding domains and are encoded by adjacent genes. Biochem. J. 1990, 272, 369-376. 

39. Blake, A.W.; McCartney, L.; Flint, J.E.; Bolam, D.N.; Boraston, A.B.; Gilbert, H.J.; Knox, J.P. 
Understanding the biological rationale for the diversity of cellulose-directed carbohydrate-binding 
modules in prokaryotic enzymes. J. Biol. Chem. 2006, 281, 29321-29329. 

40. Shoseyov, O.; Shani, Z.; Levy, I. Carbohydrate binding modules: Biochemical properties and 
novel applications. Microbiol. Mol. Biol. Rev. 2006, 70, 283-295. 

41. Kono, H.; Erata, T.; Takai, M. Complete assignment of the CP/MAS C-13 NMR spectrum of 
cellulose IIII. Macromolecules 2003, 36, 3589-3592. 

42. Kono, H; Erata, T.; Takai, M. Determination of the through-bond carbon-carbon and carbon-proton 
connectivities of the native celluloses in the solid state. Macromolecules 2003, 36, 5131-5138. 

43. Larsson, P.T.; Hult, E.L.; Wickholm, K; Pettersson, E.; Iversen, T. CP/MAS C-13-NMR spectroscopy 
applied to structure and interaction studies on cellulose I. Solid State Nuclear Magn. Reson. 1999, 
75,31-40. 

44. Larsson, P.T.; Westermark, U.; Iversen, T. Determination of the cellulose I alpha allomorph 
content in a tunicate cellulose by CP/MAS C-13-NMR spectroscopy. Carbohydr. Res. 1995, 278, 
339-343. 

45. Larsson, P.T.; Wickholm, K.; Iversen, T. A CP/MAS C-13 NMR investigation of molecular 
ordering in celluloses. Carbohydr. Res. 1997, 302, 19-25. 

46. Delaglio, F.; Grzesiek, S.; Vuister, G.W.; Zhu, G; Pfeifer, J.; Bax, A. Nmrpipe — A multidimensional 
spectral processing system based on unix pipes. J. Biomol. NMR 1995, 6, 277-293. 



Metabolites 2014, 4 



52 



47. Harris, D.M.; Corbin, K.; Wang, T.; Gutierrez, R.; Bertolo, A.L.; Petti, C; Smilgies, D.M.; 
Estevez, J.M.; Bonetta, D.; Urbanowicz, B.R.; et al. Cellulose microfibril crystallinity is reduced 
by mutating C-terminal transmembrane region residues CESA1(A903V) and CESA3(T942I) of 
cellulose synthase. Proc. Natl. Acad. Sci. USA 2012, 109, 4098-4103. 

48. Date, Y.; Sakata, K.; Kikuchi, J. Chemical profiling of complex biochemical mixtures from 
various seaweeds. Polymer J. 2012, 44, 888-894. 

49. HetMap, ECOMICS: Web tools for environmental and metabolic systems. Available online: 
https://database.riken.jp/ecomics/chika/ (accessed on 20 December 2013). 



© 2013 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article 
distributed under the terms and conditions of the Creative Commons Attribution license 
(http://creativecommons.Org/licenses/by/3.0/). 



